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ABSTRACT 

The  likelihood  ratio  test  is  presented  as  a  natural  method  to  test  for 
the  presence  of  "always  buy"  and  "never  buy"  consumers.   The  purchase 
sequence  lengths  and  sample  sizes  required  to  estimate  the  proportions  of 
these  buyers  are  determined  through  the  use  of  simulated  data.   Our  method 
was  then  applied  to  two  sets  of  well  known  data  first  analyzed  by  Aaker  [1] 
and  Montgomery  [8].   The  empirical  findings  show  that  although  over  half 
of  the  consumers  never  bought  a  given  brand,  the  specific  addition  of  a 
spike  at  p=0  to  provide  for  "never  buy"  consumers  did  not  form  a  superior 
fit  over  the  no-spike  model. 


INTRODUCTION 

Many  stochastic  models  of  buyer  behavior  assume  a  zero-order  process. 
In  each  of  these  models,  consumers  are  assumed  to  have  different  but  constant 
over  time  purchase  probabilities.   As  a  result  of  this,  a  zero-order  model 
is  completely  specified  by  the  distribution  of  purchase  probabilities  across 
the  population  of  consumers.   The  beta  distribution,  due  to  its  flexibility 
and  tractability  has  been  frequently  used  as  a  functional  form  to  charac- 
terize this  heterogeneity  in  purchase  probability  across  consumers.   The 
parameters  of  the  purchase  probability  distribution  (e.g.,  two  parameters 
of  the  beta  distribution)  can  be  estimated  from  panel  data  which  contain 
brand  purchase  information  of  many  households  over  time. 

Often  the  panel  contains  many  households  who  have  never  purchased  a 
given  brand  and  some  households  who  have  bought  the  brand  on  every  purchase 
occasion.   Do  such  data  imply  that  the  purchase  probability  distribution 
has  mass  points  at  its  extremities  of  p=0  and/or  p=l  (p  denotes  brand  pur- 
chase probability)?   These  mass  points  (or  spikes)  would  represent  the  pro- 
portion of  "always  buy"  and  "never  buy"  consumers.   We  suggest  the  likelihood 
ratio  test  as  the  natural  method  to  statistically  test  for  and  estimate  the 
size  of  these  spikes. 

The  major  contribution  of  this  paper  is  to  determine  the  purchase 
sequence  lengths  and  sample  sizes  required  to  use  the  likelihood  ratio 
test  to  detect  and  estimate  spikes.   Virtually  all  of  the  published  data  use 
purchase  sequence  lengths  of  five  or  less  to  estimate  the  parameters  of  the 
purchase  probability  distribution,  but  even  with  very  large  sample  sizes 
this  length  is  simply  not  adequate  to  test  for  spikes.   In  order  to  illustrate 
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our  methodology  the  likelihood  ratio  test  is  used  to  estimate  the  spikes 
for  two  sets  of  empirical  data  that  have  been  used  in  published  research 
work  in  marketing,  namely  the  Aaker  data  [1]  for  a  frequently  bought  con- 
sumer good  and  Montgomery's  [8],  [9]  Crest  toothpaste  data. 

The  paper  is  organized  as  follows:   We  first  describe  the  three 
alternative  models  with  zero,  one  or  two  spikes.   We  then  present  the  like- 
lihood function  approach  to  estimate  the  model  parameters.   This  is  followed 
by  the  use  of  the  likelihood  ratio  test  to  detect  and  estimate  spikes. 
Purchase  sequence  length  and  sample  size  requirements  are  then  determined 
through  the  use  of  simulated  data.   The  application  of  the  likelihood  ratio 
test  to  the  empirical  data  is  presented  next.   The  end  results  are  some 
"rules  of  thumb"  for  sample  sizes  and  purchase  sequences  lengths  needed  in 
estimating  the  parameters  of  zero  order  models. 

THE  THREE  MODELS 

Let  f(p)  denote  a  specific  functional  form  of  the  purchase  probability 

distribution  for  a  particular  brand  (say.  Brand  1).   Each  consumer  has  some 

probability,  p,  of  purchasing  Brand  1  which  takes  a  value  between  zero  and 

one",  f(p)  denotes  the  density  of  consumers  who  have  that  purchase  probability. 

For  ease  of  reference  and  clearer  understanding  let  us  introduce  and  define 

the  following  symbols: 

N  =   Number  of  consumers  in  the  product  class. 

n.   =  Number  of  consumers  who  make  j  purchases  of  Brand  1  and  (k- j ) 
purchases  of  Brand  0  ('all  other'  aggregate  class)  given  k 
purchase  occasions. 

k  =   Length  of  purchase  string,  that  is,  the  total  number  of  occasions 
for  purchasing  Brand  1  or  0. 

p.   =   ,R.  =  Expected  probability  of  making  j  purchases  of  Brand  1  out 
J     "^  J 

of  a  total  of  k  purchases. 
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c.   =  Number  of  ways  in  which  j  objects  can  be  chosen  out  of  k  objects. 

f  (j|p,k)  =  Binomial  probability  of  making  j  purchases  on  k  trials  given 
purchase  probability  is  p. 

a  =  Proportion  of  consumers  who  'never  buy'  Brand  1. 

3  =  Proportion  of  consumers  who  'always  buy'  Brand  1. 

j=0,l,2...k. 

j   =  Average  purchase  frequency . 

Table  1  lists  in  summary  form  the  three  alternative  models  tested  in 
this  paper.   Along  with  a  pictorial  presentation.  Table  1  contains  a  brief 
description,  an  expression  for  the  probability  p.,  and  average  purchase 
frequency  for  the  three  models. 


PARAMETER  ESTIMATION 

We  require  the  maximum  value  of  the  likelihood  function  for  each 
alternative  model  to  perform  the  likelihood  ratio  test.   The  likelihood 
function  for  a  model  M(where  M=0,l,2)  given  purchase  frequency  data,  n.s, 
can  be  written  as: 


£(M)  =  £(nQ,  n^,  ...  n^;   t) 


N'  "O     "l        "k 

(Pn)    (P,)    ...  (Pj  ^  (1) 


n^!  n2!  ...  n^  !   "^0'    '^1'    '  "•  ^'^k 


where  M  =  model  identification  number, 
G  =  parameters  of  the  model,  M  . 

In  equation  (1)  the  p.'s  are,  of  course,  functions  of  the  parameter 
vector  6  (in  the  unconstrained  case  of  Model  2,  for  instance,  9  represents 
m,  n,  a  and  6)  as  shown  in  Table  1.  The  maximum  likelihood  estimates 
can  be  obtained  by  maximizing  £(M)  above  with  respect  to  6.   Since  maximizing 
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a  monotonic  transform  of  £.(M)  would  not  change  the  parameter  estimates  for 
which  the  maximum  is  attained,  we  replace  the  constant  term  in  -^(M)  by 
unity  and  work  with  the  logarithm  of  the  altered  likelihood  function  for 
computational  convenience.   Thus,  the  maximum  likelihood  estimate  of  6 
is  obtained  by  maximizing  L(M),  which  is  a  monotonic  transform  of  -^(M),  where: 

n^     n,        n. 


L(M)  =  log{(pQ)  °  (P^)  ^  ...  (p^)    ^   }, 


or, 


k 
L(M)  =  I  n  log  p 
j=0  ^      ^ 


The  probabilities  p . ( j  =  0,1,...  k)  in  the  expression  are  functions  of 
the  parameters  of  the  model  as  shown  in  Table  1.   For  instance,  in  the  case 
of  Model  2,  the  p.'s  would  depend  on  the  parameters  of  this  model  —  m,  n,  a 
and  3;  the  maximum  likelihood  estimates  of  the  parameters  are  obtained  by 
maximizing  L(2)  with  respect  to  m,  n,  a  and  6.   The  expression  for  L(M)  in 
equation  (2)  above  on  substitution  for  p.  from  Table  1,  becomes  quite  com- 
plex and,  hence,  an  analytical  derivation  of  the  maximum  likelihood  estimates 
of  the  parameters  is  not  feasible.   Therefore,  numerical  optimization  is  used 
to  obtain  the  values  of  the  parameters.   The  computer  program  developed  for 
this  purpose  —  a  modified  pattern  search  —  is  based  on  the  pattern  search 
process  developed  by  Hookes  and  Jeeves  [3].   The  program  provides  an  optimiza- 
tion procedure  for  determining  the  values  of  n  different  parameters  at  which 
a  given  function  of  these  n  parameters  attains  a  global  optimum  value.   This 
modified  pattern  search  program  was  developed  by  Kalwani  for  his  doctoral 
dissertation  [4]. 
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THE  LIKELIHOOD  RATIO  TEST 

The  likelihood  ratio  test  uses  the  maximum  value  of  the  likelihood 
function  for  each  of  the  two  models.   One  model  must  be  a  constrained 
version  of  the  other  model.   Model  2  which  contains  two  spikes  in  its  pur- 
chase probability  distribution  is  unconstrained  and  has  four  parameters 
(a,  3.  m  and  n) ;   two  of  these  four  parameters  are  the  parameters  of  the 
beta  distribution  (i.e.,  m  and  n)  and  the  other  two  parameters  represent 
the  two  spikes,  a  and  6.   Model  1  which  has  one  spike  is  a  constrained  form 
of  Model  2;  it  has  three  parameters  (ot,  m  and  n) .   For  Model  1  the  spike 
parameter  B,  representing  the  proportion  of  "always  buy"  consumers  is  con- 
strained to  take  on  the  value  zero.   In  the  case  of  Model  0,  both  the 
spike  parameters,  ot  and  3,  are  restricted  to  take  on  the  value  zero.   Clearly, 

Model  0   C   Model  1  C   Model  2. 

Therefore,   L*(0)  <   L*(l)  <  L*(2). 

Where  L*(M)  represents  the  maximum  value  of  the  monotonic  transform  of  the 
likelihood  function  (see  equation  (2))  for  Model  0,  1  or  2.  While  it  is 
obvious  that  L*(0)  is  less  than  L*(l)  or  L*(2),  the  question  is  whether  or 
not  this  difference  is  statistically  significant.   That  is,  if  we  set  up 
the  constrained  model  as  the  null  hypothesis  (H  )  and  the  unconstrained 
model  as  the  alternate  hypothesis  (H  ) ,  we  would  like  to  see  if  the  data 
suggest  that  we  reject  the  null  hypothesis  in  favor  of  the  alternate  hypothesis, 
The  likelihood  ratio  test  answers  this  question.   We  have: 
H  :   Model  0  (or  1)  with  00=2  (or  3)  parameters 
H^ :   Model  1  (or  2)  with  ^=3    (or  A)  parameters  . 
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Next,  we  form  a  ration  (say.  A)  of  the  maximum  value  of  the  likelihood  func- 
tion of  the  constrained  model  under  the  null  hypothesis  (say.  Model  0)  to 
that  for  the  unconstrained  model  (say.  Model  2). 
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For  large  samples,  Wilks  [11]  has  shown  that,  -2  log  A  is  distributed  chi- 

square  with  (fi-oj)  degrees  of  freedom.   In  testing  Model  2  against  Model  0, 

since  L*(M)  from  equation  (2)  represents  the  logarithm  of  the  maximum  value 
of  the  likelihood  function  for  Model  M,  we  have: 

-2  log^AQ2  =  -2[L*(0)-  L*(2)]   =  2 [L*(2)  -  L*(0) ]  (3) 

is  distributed  chi-square  with  2  (=4-2)  degrees  of  freedom. 

PURCHASE  SEQUENCE  LENGTH  AND  SAMPLE  SIZE 

In  this  section  we  determine  the  sample  size  and  the  purchase  sequence 
length  required  to  detect  spikes.   Obviously  a  sample  of  50  consumers  with 
three  purchases  each  is  not  sufficient  to  test  if  a  proportion  of  the  con- 
sumers in  the  population  are  "always  buy"  or  "never  buy"  customers.   On 
the  other  hand,  a  sample  of  10,000  consumers  with  20  purchases  is  clearly 
adequate  to  test  for  spikes.   It  is  not  obvious  what  purchase  sequence 
lengths  and  sample  sizes  are  required  to  detect  mass-points  in  purchase 
probability  distributions.   Also,  it  would  be  useful  to  know  the  trade-off 
between  sample  size  and  purchase  sequence  length  requirements  for  detecting 
and  estimating  spikes. 
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We  determine  these  requirements  through  the  use  of  simulated  data. 
We  generate  samples  from  eleven  different  settings  which  extend  over  the 
spectrum  of  zero-order  models.   The  first  five  settings  are  based  on  pur- 
chase probability  distributions  with  no  spikes.   The  probability  distribu- 
tion is  allowed  to  take  various  shapes  like  uniform,  bell,  "decay  curve," 
U  and  reverse  J.   These  five  settings  are  described  in  greater  detail  with 
the  actual  values  of  the  parameters  in  a  different  paper  by  the  authors 
(see  Kalwani  and  Morrison  [5]). 

The  next  three  settings  from  which  simulated  sample  data  are  generated 
contain  skewed  bell  shape  distributions  with  spikes  at  p=0.   The  spikes 
representing  "never  buy"  consumers  are  of  size  5%,  10%  arid  20%  respectively. 
The  last  three  settings  are  based  on  bell  shape  purchase  probability  dis- 
tributions with  two  spikes  at  p=0  and  p=l.   The  two  spikes  are  of  equal 
size  in  each  of  the  three  cases  and  are  of  size  5%,  10%  and  20%. 

Many  researchers  have  used  sample  data  with  purchase  sequence  lengths 
of  5  or  less  for  estimating  parameters  of  their  models.   There  has  been,  how- 
ever, considerable  variation  in  the  sample  sizes  used  for  parameter  estimation. 
To  start  with,  we  begin  by  generating  samples  of  size  500  with  a  purchase 
sequence  length  of  5  for  each  of  the  eleven  settings  (5  Model  0,  3  Model  1 
and  3  Model  2).   These  sample  specifications,  as  we  shall  see  in  the  next 
section,  prove  to  be  adequate  for  the  first  five  settings  of  models  with  no 
spikes.   For  the  one  and  two  spike  models,  however,  these  sample  specifica- 
tions turn  out  to  be  inadequate  and  we  repeat  the  analysis  by  increasing  the 
sample  size  to  1000  with  a  purchase  sequence  length  of  5  and  then  by  increas- 
ing the  purchase  sequence  length  to  10  for  samples  of  size  500. 
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The  testing  procedure  for  the  simulated  data  consists  of  generating  10 
samples  at  each  preset  sample  specification  (say,  sample  size  =  500  and  pur- 
chase sequence  length  =  5) ,   The  likelihood  ratio  test  is  performed  on  the 
10  simulations.   The  likelihood  ratio  test  determines  the  model  (no-spike, 
one-spike  or  two-spike)  which  forms  the  best  fit  for  the  simulated  sample 
data  obtained  from  each  of  the  10  simulation  runs.   From  this  testing  of 
the  simulated  data  we  determine  the  number  of  times  (out  of  10)  the  true 
model  (no-spike,  one-spike  or  two-spike)  that  is  used  to  generate  the  sample 
data  is  correctly  identified  as  providing  the  best  fit  for  the  data.   Sample 
specifications  in  terms  of  sample  size  and  purchase  sequence  length  are  ade- 
quate if  the  true  model  can  be  correctly  identified  most  of  the  time  with 
these  specifications. 

Let  us  now  analyze  the  findings  from  this  testing  of  the  simulated  data 
which  is  performed  at  a  confidence  level  of  95%.   Sample  specifications  of 
sample  size  =  500  and  purchase  sequence  length  =  5  are  found  to  be  adequate 
for  no-spike  models.   That  is,  for  the  sample  data  generated  from  the  first 
five  settings  (no-spike  models)  the  null  hypothesis  of  Model  0  with  no  spikes 
is  never  rejected  in  favor  of  the  alternate  hypothesis  of  Model  1  with  one 
spike  or  Model  2  with  two  spikes. 

Table  2  displays  the  results  obtained  in  the  case  of  one-spike  and 
two-spike  models.   As  mentioned  earlier  the  likelihood  ratio  test  is  per- 
formed for  three  different  sample  specifications  in  case  of  the  spiked 
models.   Under  each  of  the  three  sample  specifications  Table  2  displays  for 
each  setting  the  number  of  times  out  of  10  that  a  particular  model  provides 
the  best  fit  for  the  simulated  data.   The  number  of  times  for  which  the  true 
model  is  correctly  identified  is  underlined.   As  an  illustration  of  the  results 
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TABLE  2 


Setting  # 

Sample  Size  =  500 
Purch.  Seq. 
Length  =  5 

Sample  Size  =  1000 
Purch.  Seq. 
Length  =  5 

Sample  Size  =  500 
Purch.  Seq. 
Length  =  10 

Mod.  0 

Mod.  1 

Mod.  2 

Mod-  0 

Mod.  1 

Mod-  2 

Mod.  0 

Mod.  1 

Mod.  2 

6.  one-spike 
a  =  0.05 

9 

0 

1 

7 

3 

2 

8 

7.  one-spike 
a  =  0.10 

7 

3 

3 

I 

10 

8.  one-spike 
a  =  0.20 

A 

6 

8 

2 

10 

9.  two-spike 
a  =  0.05 
6  =  0.05 

8 

1 

1 

6 

4 

10 

10  two- spike 
a  =  0.10 
e  =  0.10 

3 

2 

5 

1 

9 

10 

11  two-spike 
a  =  0.20 
6  =  0.20 

3 

I 

10 

10 

displayed  in  Table  2,  the  findings  for  the  setting  7  show  that  with  the 
sample  specifications  of  size  500  and  purchase  sequence  length  of  5,  the 
true  model  (i.e..  Model  1  with  one  spike)  is  correctly  identified  only 
three  times  out  of  ten.   Doubling  the  sample  size  to  1000  with  the  same 
purchase  sequence  length  raises  the  number  of  correctly  identified  models 
to  7  whereas  doubling  the  purchase  sequence  length  leads  to  corrent  iden- 
tification of  the  true  model  all  the  ten  times.   The  remainder  of  Table  2 
can  be  interpreted  in  the  same  manner  for  settings  8  through  11. 
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Several  interesting  and  useful  conclusions  can  be  drawn  from  the 
findings  in  Table  2.   The  first  rather  obvious  inference  we  make  is  that 
it  is  easier  to  detect  larger  spikes  both  for  one-'spike  and  two-spike 
models. 

For  the  sample  specifications  of  size  500  and  purchase  sequence 
length  of  5,  the  number  of  times  the  true  model  is  correctly  identified 
are  9  (out  of  30)  for  the  one-spike  models  and  13  for  the  two-spike  models. 
Raising  the  sample  size  to  1000  (the  purchase  sequence  length  is  still  5) 
increases  these  numbers  to  18  and  23  respectively.   The  gain  in  spike 
detecting  power  is  much  greater  if  the  purchase  sequence  length  rather 
than  the  sample  size  is  doubled.   For  sample  size  =  500  and  purchase 
sequence  length  =  10,  the  number  of  times  the  true  model  is  detected  are 
28  for  the  one-spike  models  and  30  for  the  two-spike  models.   We  conclude 
that  doubling  the  purchase  sequence  length  from  5  to  10  gives  more  addi- 
tional power  to  detect  spikes  rather  than  doubling  the  sample  size  from 
500  to  1000. 

Overall  a  purchase  sequence  length  of  10  with  sample  size  of  500  or 
larger  is  adequate  to  identify  the  true  model  (58  out  of  60  times  for  models 
with  one  or  two  spikes) .   We  conclude  that  a  purchase  sequence  length  of 
10,  which  for  many  products  and  most  consumers  represents  purchases  of 
branded  goods  over  half  a  year,  represents  a  desirable  purchase  sequence 
length  to  detect  and  estimate  the  proportion  of  "always  buy"  and/or  "never 
buy"  consumers. 
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EMPIRICAL  DATA 

Description: 

The  empirical  data  used  In  this  paper  come  from  two  sources.   The 
first  data  set  Is  from  Aaker  [1]  who  obtained  It  from  the  MRCA  panel. 
This  sample  only  Includes  new  triers  of  a  particular  brand  of  a  frequently 
bought  good.   The  second  source  for  the  empirical  data  Is  from  Montgomery  [8] 
and  once  again  the  original  source  Is  the  MRCA  panel.   The  data  are  from 
the  dentrlflce  market  just  prior  to  and  Immediately  following  the  endorse- 
ment of  Crest  toothpaste  by  the  American  Dental  Association.   Both  the  before 
and  after  endorsement  data  are  used  In  this  paper.   Montgomery's  model  and 
a  description  of  the  data  can  be  found  In  Montgomery  [9] .   However,  the 
specific  numbers  In  Table  3  are  only  In  the  unpublished  working  paper, 
Montgomery  [8]. 

All  the  three  data  sets  are  for  a  purchase  sequence  length  of  5  and 
the  purchase  frequencies  are  displayed  In  Table  3.   In  view  of  the  findings 
from  the  simulated  data  in  the  previous  section  we  should  note  that  the  sam- 
ple size  of  631  (with  5  purchases  per  consumer)  for  Aaker 's  data  is  not  suf- 
ficient to  test  for  spikes. 

TABLE  3 


#  of  purchases 
of  Brand  1 


Aaker ' s  Data 


Montgomery's 
"Before  Endorse- 
ment" Data 


Montgomery' s 
"After  Endorse- 
ment" Data 


0 

1 
2 
3 
4 
5 


352 
125 
62 
38 
30 
24 


2,212 

383 

154 

91 

89 

106 


1,683 
436 
270 
203 
209 
234 


TOTAL 


631 


3,035 


3,035 


eeo,£ 


ctCt 


rrf'>. 
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Flndings; 

Our  objective  is  to  find  which  one  of  the  three  models  (namely  Models 
0,  1  or  2),  on  the  basis  of  the  likelihood  ratio  test,  provides  the  best 
fit.   In  order  to  perform  the  likelihood  ratio  test  we  need  to  calculate 
the  logarithm  of  the  maximum  value  of  the  likelihood  function  for  each  of 
the  three  models.   Computer  optimization  through  the  modified  pattern 
search  program  provides  the  logarithm  of  the  likelihood  function  for  each 
of  the  models  and  the  maximum  likelihood  estimates  of  the  parameters.   The 
results  from  these  computer  runs  are  displayed  in  Table  4. 

Table  4  displays  in  rows  I  the  maximum  likelihood  estimates  of  the  Model 

parameters  —  m,  n,  ot  and  6.   The  next  set  of  rows  II  contain  the  logarithm 

of  the  maximum  value  of  the  likelihood  function.   The  last  set  of  rows  III 

display  the  test  statistic,  -2  log  A,  for  each  of  the  data  sets.   For  instance, 

-2  log  A„„  is  the  test  statistic  when  Model  0  is  the  null  hypothesis  and 
e  02 

Model  2  is  the  alternate  hypothesis;  equation  (3)  is  used  to  compute  this 
test  statistic. 

Examination  of  the  test  statistic  values  reveals  that  for  Aaker's 
data  null  hypothesis  of  Model  0  is  not  rejected  in  favor  of  either  Model  1 
or  2;  we  conclude  that  Model  0  provides  the  best  fit  for  Aaker's  data  (but 
recall  the  inadequacy  of  his  sample  size).   In  the  case  of  Montgomery's 
"Before  Endorsement"  data  Models  0  and  1  are  rejected  in  favor  of  Model  2; 
hence  Model  2  forms  the  best  fit  to  the  second  data  set.   For  the  third  data 
set  Model  0  provides  the  best  fit  since  it  is  not  rejected  in  favor  of  Model 
1  or  2.   In  fact  it  is  a  little  disconcerting  that  for  this  third  set  of 
data  the  addition  of  spikes  has  absolutely  no  effect  on  the  likelihood  func- 
tions. 

The  most  significant  finding  in  Table  4  is  that  Model  1  does  not 

constitute  a  best  fit  in  any  of  the  three  cases  (see  test  statistic  values, 
-2  logA   ) ,   This  result  in  view  of  the  proportion  of  the  consumers  with 
no  purchases  of  Brand  1  is  nonintuitive  and  warrants  an  explanation.   For 
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the  three  data  sets  displayed  in  Table  3  the  proportions  representing  the 
consumers  who  make  0  purchases  of  Brand  1  are  56%,  72%  and  55%  respectively. 
An  examination  of  the  proportions  at  the  other  purchase  frequencies  shows 
that  the  shape  of  the  purchase  probability  distribution  representing  these 
data  sets  is  likely  to  be  either  U-shaped  or  concave  (when  m  and  n  are  both 
less  than  1)  or  a  shape  where  the  curve  steadily  declines  (when  ra<l<n) .   This 
observation  is  confirmed  by  the  maximum  likelihood  estimates  of  the  parameters 
of  the  beta  distribution  (n  and  n)  displayed  in  Table  4  for  Model  0. 

These  two  shapes  are  displayed  in  Figures  1  and  2.   The  proportion 
of  consumers  who  make  one  or  more  purchases  of  Brand  1  constrain  the  purchase 


f(p) 


f(p) 


Figure  1:   m<n<l 


Figure  2:   m<l<n 


probability  distribution  to  take  one  of  the  two  shapes  shown  in  the  figures 
above.   To  detect  a  spike  on  these  purchase  probability  distributions  is 
going  to  be  difficult  as  is  illustrated  in  Figure  3.   In  fact,  many  researchers 
would  argue  that  Figure  3  is  essentially  equivalent  to  Figure  1.   On  the  other 
hand,  a  spike  at  p=0  would  be  easier  to  detect  for  the  case  shown  in  Figure  4. 


f(p) 


f(p) 


Figure  3 
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We  conclude  that  it  is  the  frequencies  for  non-zero  purchases  which 
determine  the  shape  of  the  purchase  probability  distributions.   Our  findings 
have  support  in  the  work  of  marketing  practitioners.  According  to  Butler  [2] 
and  Kropp  [6]  the  purchase  probability  distributions  for  mature  brands  are 
generally  described  by  U-shaped  distributions.   Stewart  [10]  in  an  empirical 
study  of  distributions  of  purchase  proportions  states  that,  "the  distribu- 
tion of  proportions  is  U-shaped  for  virtually  all  brands  in  each  field  irre- 
spective of  their  size."  Note  that  the  Aaker  data  which  is  for  a  new  product 
gives  a  steadily  decreasing  J  (or  non-U)  shaped  distributed.   Montgomery's 
data  on  the  established  Crest  brand  gives  two  U-shaped  distributions. 

CONCLUSION 

In  this  paper,  we  have  presented,  the  use  of  the  likelihood  ratio  test 
as  a  natural  method  to  estimate  for  the  presence  of  "always  buy"  and  "never 
buy"  consumers. 

Purchase  sequence  length  and  sample  size  requirements  were  determined 
through  the  use  of  simulated  data.   We  found  that  doubling  the  purchase 
sequence  length  from  5  to  10  gives  more  additional  power  to  detect  spikes 
rather  than  doubling  the  sample  size  from  500  to  1000.   In  other  words  for 
spike  detecting  power  it  is  useful  to  trade-off  an  increase  in  sample  size 
for  an  increase  in  purchase  sequence  length.   As  a  rule  of  thumb,  a  purchase 
sequence  length  of  10  is  needed  to  detect  spikes  especially  if  the  sample 
size  is  less  than   1000.   Not  many  published  studies  in  the  marketing 
literature  have  had  this  large  a  purchase  sequence  length.   Most  researchers 
have  used  purchase  lengths  of  5  or  less. 
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Our  findings  show  that  although  the  empirical  data  contained  more  than 
50%  consumers  who  had  never  bought  a  given  brand  the  specific  addition  of  a 
spike  to  provide  for  "never  buy"  consumers  did  not  form  a  superior  fit  over 
the  no-spike  model.   The  three  data  sets  used  here  provide  additional  support 
for  U-shaped  purchase  probability  distribution  found  by  some  marketing  prac- 
titioners.  However,  it  is  not  fair  to  conclude  that  these  three  data  sets 
included  no  "never  buy"  or  "always  buy"  consumers.   As  we  have  seen  the  pur- 
chase sequence  length  of  five  used  in  these  studies  is  inadequate  to  detect 
spikes.   We  merely  have  shown  that  a  beta  distribution  without  spikes  can 
fit  these  data  sets  as  well  as  the  more  general  distribution  that  includes 
spikes. 
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